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LISTING OF CLAIMS 



1 . (Currently Amended) A method for coding macroblook v ideo data 
according to layered coding techniques in which the macroblock video data is represented 
as multi-layered frames, each frame having a plurality of references in multiple layers 
ranging from a base layer of low quality to enhancement layers of increasingly higher 
quality, wh e r e in th e m e thod us e s a coding mod e to s e l e ct a ref e r e nce to b e us e d for 
pr e diction and r e construction at e ach macroblock of vid e o data, the method comprising 
th e st e ps of : 

encoding macroblocks of the video data t o produce a first bitstream representing a 
base layer; 

selecting a coding mode for each macroblock, the coding mode being selected 
from a group comprising at least three coding modes where each coding mode specifies a 
prediction reference and a reconstruction reference, the prediction reference determining 
a predicted macroblock for a current frame and the reconstruction reference determining 
a reference for a next frame; 

encoding each macroblocks of the video data using the coding mode selected for 
the macroblock, the encoding of the macroblockst e producinge a second bitstream 
representing one or more enhancement layers using an INTER coding mode selected 
from A the group of at least three coding modes comprising oonsisting of : 

an LPLR coding mode that specifies the prediction reference as a low 
quality reference in a previous frame and the reconstruction reference as the low 
quality reference in the previous frame; t hat encod e s macroblocks to produc e the 
s e cond bitstr e am by a pr e diction from a low quality reference in a pr e vious frame, - 
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wh e r e in a high quality ref e r e nc e of th e curr e nt fram e is r e construct e d from tho 
low quality r e f e r e nce in th e pr e vious fram e ; 

an HPHR coding mode that specifies the prediction reference as a high 
quality reference in a previous frame and the reconstruction reference as the high 
quality reference in the previous frame; that e ncod e s maoroblooks to produc e th e 
se cond bitstr e om by a pr e diction from a high quality r e f e r e nc e in a pr e vious 
fram e , wh e r e in th e high quality r e f e r e nc e in th e pr e vious fram e is us e d to 
r e construct a high quality r e ference in th e curr e nt fram e ; and 

an HPLR coding mode that specifies the prediction reference as a high 
quality reference in a previous frame and the reconstruction reference as a low 
quality reference in the previous frame, wherein the high quality reference in the 
previous frame is not the same as the low quality reference in the previous frame, 
that e ncod e s macroblocks to produc e th e second bitstr e om by a pr e diction from a 
high quality r e f e r e nc e in a pr e vious fram e , wh e r e in: 

a high quality r e f e r e nc e of the curr e nt fram e is 

r e con s truct e d from a low quality r e fer e nce in th e pr e vious fram e ; 

tXTRX 

th e high quality refer e nc e in th e previous frame is not the sam e as the low quality 

ref e r e nc e in th e pr e vious fram e . 

wherein at least two of the coding modes are utilized to produce the second 
bitstream. 

2. (Currently Amended) The method as set forth in Claim 1, wherein: 
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encoding macroblocks to produce the first and second bitstreams further 
comprises estimating drifting information that occurs from the encoding; and 

the INTER coding mode is selected from the group based upon a derivation that 
uses the estimated drifting information to maximize coding efficiency while minimizing 
drifting error. 
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3. (Currently Amended) The method as set forth in Claim 1, wherein: 

encoding macroblocks to produce a the first bitstream representing the a base 
layer comprises motion compensating an original image using the base layer as a 
reference to form a low quality predicted image in the pixel domain xt>; 

encoding macroblocks to produce a the second bitstream representing one or more 
enhancement layers using one of the an INTER coding modes further comprises motion 
compensating an original image using the enhancement layer as a reference to form a 
high quality predicted image in the pixel domain Xe; 

transforming the low quality predicted image in the pixel domain Xb to form low 
quality predicted coefficients Xb; 

quantizing the low quality predicted coefficients to form quantized coefficients; 

dequantizing the quantized coefficients to form dequantized coefficients X b ; 

inverse transforming the dequantized coefficients X b to form inverse dequantized 
coefficients as a reconstructed base layer in pixel the domain x b ; 

taking the absolute value of the difference between x b and: 

the base layer of the motion compensated original image in the pixel 
domain Xb to form a first value; and 

the high quality predicted image in the pixel domain x e to form a second 

value; 

selecting the LPLR mode when the first value less than or equal to the second 
value; and 

selecting the HPLR mode or the HPHR mode when the first value greater then the 
second value. 
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4. (Original) A method as recited in Claim 1, further comprising: 
using variable length coding to compress the first and second bitstreams; 

transmitting the compressed first and second bitstreams over a network; 

decompressing and decoding the first bitstream representing the base layer into 
the video data; and 

decompressing and decoding the second bitstream representing one or more 
enhancement layers into the video data. 

5. (Original) A method as recited in Claim 4, further comprising 
reconstructing a missing enhancement layer from one of the high and low high quality 
references in the previous frame. 

6. (Original) The method as defined in Claim 1, wherein the layered coding 
techniques comprise a Progressive Fine Granularity Scalable (PFGS) video coding. 
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7. (Currently Amended) A method for coding maoroblock v ideo data 
according to layered coding techniques in which the maoroblock v ideo data is represented 
as multi-layered frames, each frame having a plurality of references in multiple layers 
ranging from a base layer of low quality to enhancement layers of increasingly higher 
quality, the method comprising th e st e ps of : 

motion compensating an original image using the base layer as a reference to 
form a low quality predicted image; 

transforming the low quality predicted image to form low quality predicted 
coefficients; 

quantizing the low quality predicted coefficients to form quantized coefficients; 
dequantizing the quantized coefficients to form dequantized coefficients; 
inverse transforming the dequantized coefficients to form inverse dequantized 
coefficients in the form of a reconstructed base layer in pixel the domain x 6 ; 
taking the absolute value of the difference between x b and: 

the base layer of the motion compensated original image in the pixel 
domain Xb to form a first value; and 

one said enhancement layer of the motion compensated original image in 
the pixel domain x e to form a second value; 

selecting the LPLR mode when the first value less than or equal to the second 
value; and 

selecting the HPLR mode or the HPHR mode when the first value greater then the 
second value. 
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8. (Original) The method as defined in Claim 7, wherein the layered coding 
techniques comprise a Progressive Fine Granularity Scalable (PFGS) video coding. 

9. (Currently Amended) A computer-readable medium having computer- 
executable instructions, which when executed on a processor, direct a computer to 
perform the methodsteps of Claim 7. 
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10. (Currently Amended) A computer-readable medium having computer- 
executable instructions, which when executed on a processor, direct a computer to: 

encode macroblocks of video data according to layered coding techniques in which 
the macroblock video data is represented as multi-layered frames, each frame having a 
plurality of references in multiple layers ranging from a base layer of low quality to 
enhancement layers of increasingly higher quality, including encoding macroblocks to 
produce a first bitstream representing a base layer, and encoding macroblocks to produce 
a second bitstream representing one or more enhancement layers using a n INTER coding 
mode selected from the a group comprising at least three coding modes, the group 
including consisting of : 

an LPLR coding mode that encodes macroblocks to produce the second 

bitstream by a prediction from a low quality reference in a previous frame, 

wherein a high quality reference of the current frame is reconstructed from the 

low quality reference in the previous frame; 

an HPHR coding mode that encodes macroblocks to produce the second 

bitstream by a prediction from a high quality reference in a previous frame, 

wherein the high quality reference in the previous frame is used to reconstruct a 

high quality reference in the current frame; and 

an HPLR coding mode that encodes macroblocks to produce the second 

bitstream by a prediction from a high quality reference in a previous frame, 

wherein: 

a high quality reference of the current frame is reconstructed from a 
low quality reference in the previous frame; and 
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the high quality reference in the previous frame is not the same as the 
low quality reference in the previous frame. 

1 1 . (Original) A computer-readable medium as recited in claim 10, further 
having instructions that direct a computer to store the base layer and the one or more 
enhancement layers in memory. 

12. (Original) A computer-readable medium as recited in claim 10, further 
having instructions that direct a computer to: 

transmit the base layer over a network; and 

transmit the one or more enhancement layers over the network according to 
bandwidth availability on the network. 

13. (Original) A computer-readable medium as recited in claim 10, further 
having instructions that direct a computer to recover the video data from the base layer 
and any of the one or more enhancement layers. 

14. (Original) A computer-readable medium as recited in claim 10, further 
having instructions that direct a computer to reconstruct a missing enhancement layer 
from an enhancement layer of a reference reconstructed frame. 
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15. (Currently Amended) A video coding system to encode macroblocks of 
video data according to layered coding techniques in which the macroblock video data is 
represented as multi-layered frames, each frame having a plurality of references in 
multiple layers ranging from a base layer of low quality to enhancement layers of 
increasingly higher quality, the video coding system comprising: 

a base layer encoder to encode macroblocks of the video data t o produce a first 
bitstream representing a base layer; and 

an enhancement layer encoder to encode macroblocks of the video data t o produce a 
second bitstream representing one or more enhancement layers using an INTER coding 
mode selected from the a group comprising at least three coding modes, the group 
including consisting of : 

an LPLR coding mode that encodes macroblocks to produce the second 
bitstream by a prediction from a low quality reference in a previous frame, 
wherein a high quality reference of the current frame is reconstructed from the 
low quality reference in the previous frame; 

an HPHR coding mode that encodes macroblocks to produce the second 
bitstream by a prediction from a high quality reference in a previous frame, 
wherein the high quality reference in the previous frame is used to reconstruct a 
high quality reference in the current frame; and 

an HPLR coding mode that encodes macroblocks to produce the second 
bitstream by a prediction from a high quality reference in a previous frame, 
wherein: 

a high quality reference of the current frame is reconstructed from 
a low quality reference in the previous frame; and 
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the high quality reference in the previous frame is not the same as the 
low quality reference in the previous frame. 

16. (Original) An operating system comprising the video coding system of 
claim 15. 
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1 7. (Currently Amended) A video coding system for encoding maoroblocks of 
video data according to layered coding techniques in which the maoroblock v ideo data is 
represented as multi-layered frames, each frame having a plurality of references in 
multiple layers ranging from a base layer of low quality to enhancement layers of 
increasingly higher quality, the video coding system comprising: 

means for encoding macroblocks of the video data to produce a first bitstream 
representing a base layer; and 

means for encoding macroblocks of the video data to produce a second 
bitstream representing one or more enhancement layers using an INTER coding 
mode selected from athe group including consisting of : 

an LPLR coding mode that encodes macroblocks to produce the second 
bitstream by a prediction from a low quality reference in a previous frame, 
wherein a high quality reference of the current frame is reconstructed from the 
low quality reference in the previous frame; 

an HPHR coding mode that encodes macroblocks to produce the second 
bitstream by a prediction from a high quality reference in a previous frame, 
wherein the high quality reference in the previous frame is used to reconstruct a 
high quality reference in the current frame; and 

an HPLR coding mode that encodes macroblocks to produce the second 
bitstream by a prediction from a high quality reference in a previous frame, 
wherein: 

a high quality reference of the current frame is reconstructed from 
a low quality reference in the previous frame; and 
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the high quality reference in the previous frame is not the same as the 
low quality reference in the previous frame. 

1 8. (Original) An operating system comprising the video coding system of 
claim 17. 
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19. (Currently Amended) A video delivery architecture, comprising: 
a content provider having a video storage to store video data, a video server to 
serve the video data over a network, a base layer encoder, and an enhancement layer 
encoder, the video server being configured to encode macroblocks of video data 
according to layered coding techniques in which the macroblock video data is represented 
as multi-layered frames, each frame having a plurality of references in multiple layers 
ranging from a base layer of low quality to enhancement layers of increasingly higher 
quality, wherein: 

the base layer encoder encodes macroblocks to produce a first bitstream 
representing a base layer; and 

the enhancement layer encoder encodes macroblocks to produce a second 
bitstream representing one or more enhancement layers using an INTER coding 
mode selected from the a group comprising at least three coding modes, the group 
including consisting of : 

an LPLR coding mode that encodes macroblocks to produce the 
second bitstream by a prediction from a low quality reference in a 
previous frame, wherein a high quality reference of the current frame is 
reconstructed from the low quality reference in the previous frame; 

an HPHR coding mode that encodes macroblocks to produce the 
second bitstream by a prediction from a high quality reference in a 
previous frame, wherein the high quality reference in the previous frame is 
used to reconstruct a high quality reference in the current frame; and 
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an HPLR coding mode that encodes macroblocks to produce the 
second bitstream by a prediction from a high quality reference in a 
previous frame, wherein: 

a high quality reference of the current frame is 

reconstructed from a low quality reference in the previous frame; 

and 

the high quality reference in the previous frame is not the same 
as the low quality reference in the previous frame; 
a client configured to receive the encoded video data served from the content 
provider, the client being configured to decode the video data. 

20. (Original) A video delivery architecture as recited in claim 19, wherein: 
the video server transmits the encoded video data as composing the base layer and the 

one or more of the enhancement layers; and 

the client decodes the video data from the base layer and the one or more 
enhancement layers. 

21. (Original) A video delivery architecture as recited in claim 19, wherein the 
client reconstructs an enhancement layer in a particular frame from an enhancement layer 
of a reference reconstructed frame. 
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22. (Currently Amended) A method for coding maoroblock v ideo data 
according to layered coding techniques in which the maoroblock video data is represented 
as multi-layered frames, each frame having a plurality of references in multiple layers 
ranging from a base layer of low quality to enhancement layers of increasingly higher 
quality, wh e r e in th e method us e s a coding mode to s e l e ct a r e f e r e nc e to b e us e d for 
pr e diction and r e construction both at th e bas e lay e r and at th e e nhancem e nt lay e r, the 
method comprising: 

encoding macroblocks of the video data t o produce a first bitstream representing a 
base layer; 

encoding macroblocks of the video data t o produce a second bitstream 
representing one or more enhancement layers using an INTER coding mode selected 
from athe group including consisting of : 

a BHPLR coding mode that encodes macroblocks to produce the second 

bitstream by a prediction from a high quality reference in a previous frame, 

wherein: 

a low quality reference of the current frame is reconstructed from, 
and is of the same quality as, a low quality reference in the previous 
frame; and 

a high quality reference of the current frame is reconstructed from, 
and is of the same quality as, the high quality reference in the previous 
frame; 

a BHPHR coding mode that encodes macroblocks to produce the second 
bitstream by a prediction from a high quality reference in a previous frame, 
wherein: 
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a low quality reference of the current frame is reconstructed from 
the high quality reference in the previous frame; and 

a high quality reference of the current frame is reconstructed from, 
and is of the same quality as, the high quality reference in the previous 
frame. 



23. (Currently Amended) A computer-readable medium having computer- 
executable instructions, which when executed on a processor, direct a computer to 
perform the method s teps-of Claim 22. 



24. (Original) The method as defined in Claim 22, wherein the layered coding 
techniques comprise a Progressive Fine Granularity Scalable (PFGS) video coding. 
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25. (Currently Amended) A video delivery architecture, comprising: 
a content provider having a video storage to store video data, a video server to 
serve the video data over a network, a base layer encoder, and an enhancement layer 
encoder, the video server being configured to encode macroblocks of video data 
according to layered coding techniques in which the macroblock video data is represented 
as multi-layered frames, each frame having a plurality of references in multiple layers 
ranging from a base layer of low quality to enhancement layers of increasingly higher 
quality, wherein: 

the base layer encoder encodes macroblocks to produce a first bitstream 
representing a base layer; and 

the enhancement layer encoder encodes macroblocks to produce a second 
bitstream representing one or more enhancement layers using an INTER coding 
mode selected from athe-group including consi s ting of : 

a BHPLR coding mode that encodes macroblocks to produce the second 
bitstream by a prediction from a high quality reference in a previous frame, 
wherein: 

a low quality reference of the current frame is reconstructed from, 
and is of the same quality as, a low quality reference in the previous 
frame; and 

a high quality reference of the current frame is reconstructed from, 
and is of the same quality as, the high quality reference in the previous 
frame; 
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a BHPHR coding mode that encodes macroblocks to produce the second 
bitstream by a prediction from a high quality reference in a previous frame, 
wherein: 

a low quality reference of the current frame is reconstructed from 
the high quality reference in the previous frame; and 

a high quality reference of the current frame is reconstructed from, 
and is of the same quality as, the high quality reference in the 
previous frame; 

a client configured to receive the encoded video data served from the content 
provider, the client being configured to decode the video data. 

26. (Original) A video delivery architecture as recited in claim 25, wherein: 
the video server transmits the encoded video data as composing the base layer and the 

one or more of the enhancement layers; and 

the client decodes the video data from the base layer and the one or more 
enhancement layers. 

27. (Original) A video delivery architecture as recited in claim 25, wherein the 
client reconstructs an enhancement layer in a particular frame from an enhancement layer 
of a reference reconstructed frame. 



Lbb & Hayes , pllc 



20 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



28. (Currently Amended) A method for coding macroblock v ideo data 
according to layered coding techniques in which the macroblock video data is represented 
as multi-layered frames in multiple layers ranging from a base layer of low quality to 
enhancement layers of increasingly higher quality, the frames including frames n-1, n, 
and n+1, wherein frames n-1 and n+1 have a plurality of references, and wh e r e in th e 
m e thod us es a coding mod e to s e l e ct a r e fer e nce to b e u se d for pr e diction and 
r e construction both in a backward dir e ction in th e fram e s and in a forward dir e ction in 
th e fram e s, the method comprising: 

encoding macroblocks of the video data to produce a first bitstream representing a 
base layer; 

encoding macroblocks of the video data to produce a second bitstream 
representing one or more enhancement layers using an INTER coding mode selected 
from ajhe group including consisting of : 

a first coding mode that encodes macroblocks to produce the second 
bitstream for the n frame by temporal predictions from both a low quality 
reference in the n-1 frame and a low quality reference in the n+1 frame; 

a second coding mode that encodes macroblocks to produce the second 
bitstream for the n frame by temporal predictions from: 
a low quality reference in the n-1 frame; 
a low quality reference in the n+1 frame; 
a high quality reference in the n-1 frame; and 
a high quality reference in the n+1 frame; 
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wherein the low quality references in the n-1 and n+1 frames are used to 
predict only the lowest quality enhancement layer in the n frame; 

a third coding mode that encodes macroblocks to produce the 
second bitstream for the n frame by temporal predictions from: 

a high quality reference in the n-1 frame; and 

a high quality reference in the w+7 frame. 

29. (Currently Amended) A computer-readable medium having computer- 
executable instructions, which when executed on a processor, direct a computer to 
perform the method s teps of Claim 28. 

30. (Original) The method as defined in Claim 28, wherein the layered coding 
techniques comprise a Progressive Fine Granularity Scalable (PFGS) video coding. 
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3 1 . (Currently Amended) A video delivery architecture, comprising: 
a content provider having a video storage to store video data, a video server to 
serve the video data over a network, a base layer encoder, and an enhancement layer 
encoder, the video server being configured to encode macroblocks of video data 
according to layered coding techniques in which the macroblock video data is represented 
as multi-layered frames, each frame having a plurality of references in multiple layers 
ranging from a base layer of low quality to enhancement layers of increasingly higher 
quality, the frames including frames /?-/, n, and n+1, wherein frames n-1 and n+1 have a 
plurality of references, wherein: 

the base layer encoder encodes macroblocks to produce a first bitstream 
representing a base layer; and 

the enhancement layer encoder encodes macroblocks to produce a second 
bitstream representing one or more enhancement layers using an INTER coding 
mode selected from athe-group including consisting -ef: 

a first coding mode that encodes macroblocks to produce the 
second bitstream for the n frame by temporal predictions from both a low quality 
reference in the n-1 frame and a low quality reference in the n+1 frame; 

a second coding mode that encodes macroblocks to produce the second 
bitstream for the n frame by temporal predictions from: 
a low quality reference in the n-1 frame; 
a low quality reference in the n+l frame; 
a high quality reference in the n-1 frame; and 
a high quality reference in the n+l frame; 
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wherein the low quality references in the n-1 and «+/ frames are used to 
predict only the lowest quality enhancement layer in the n frame; 

a third coding mode that encodes macroblocks to produce the second 
bitstream for the n frame by temporal predictions from: 

a high quality reference in the n-1 frame; and 
a high quality reference in the n+1 frame; 
a client configured to receive the encoded video data served from the content 
provider, the client being configured to decode the video data. 

32. (Original) A video delivery architecture as recited in claim 3 1 , wherein: 
the video server transmits the encoded video data as composing the base layer and the 

one or more of the enhancement layers; and 

the client decodes the video data from the base layer and the one or more 
enhancement layers. 

33. (Original) A video delivery architecture as recited in claim 3 1 , wherein the 
client reconstructs an enhancement layer in a particular frame from an enhancement layer 
of a reference reconstructed frame. 
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34. (Currently Ameded) A method for coding macroblock v ideo data 
according to layered coding techniques in which the macroblock video data is represented 
as multi-layered frames, each frame having a plurality of references in multiple layers 
ranging from a base layer of low quality to enhancement layers of increasingly higher 
quality, the method comprising: 

encoding macroblocks of the video data t o produce a first bitstream representing a 
base layer; 

encoding macroblocks of the video data t o produce a second bitstream 
representing a plurality of high resolution enhancement layers and a plurality of low 
resolution enhancement layers using an INTER coding mode selected from athe-group 
including consisting of : 

a first coding mode that encodes macroblocks to produce the second 
bitstream having high and low resolution components in a current frame, wherein: 
both the low and high resolution components are predicted in the 
temporal domain from a low quality, low resolution reference in a 
previous frame; and 

a high resolution, high quality reference in the current frame is 
reconstructed from the low quality, low resolution reference in the 
previous frame; 

a second coding mode that encodes macroblocks to produce the second 
bitstream having high and low resolution components in a current frame, wherein: 
the low resolution components are predicted in the temporal 
domain from a low quality, low resolution reference in a previous frame; 
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the high resolution components are predicted in the temporal 
domain from a high quality, high resolution reference of the previous 
frame; and 

a high resolution, high quality reference in the current frame is 
reconstructed from, and is of the same quality as, the high quality, high 
resolution reference in the previous frame; 

a third coding mode that encodes macroblocks to produce the second 
bitstream having high and low resolution components in a current frame, wherein: 
one said low resolution enhancement layer predicts another said 
low resolution enhancement layer of higher quality in the transform 
domain; 

one said high resolution enhancement layer predicts another said 
high resolution enhancement layer of higher quality in the transform 
domain; 

one said low resolution enhancement layer predicts in the 
transform domain one said high resolution enhancement layer; 

a high resolution, high quality reference in the current frame is 
reconstructed from the low quality, low resolution reference in the 
previous frame; 

a fourth coding mode that encodes macroblocks to produce the second 
bitstream having high and low resolution components in a current frame, wherein: 
the one or more enhancement layers include a plurality of high 
resolution enhancement layers and a plurality of low resolution 
enhancement layers; 
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one said low resolution enhancement layer predicts another said 
low resolution enhancement layer of higher quality in the transform 
domain; 

one said high resolution enhancement layer predicts another said 
high resolution enhancement layer of higher quality in the transform 
domain; 

the high resolution enhancement layers are not predicted in the 
transform domain from the low resolution enhancement layers; 

the low resolution components are predicted in the temporal 
domain from a low quality, low resolution reference in a previous frame; 

the high resolution components are predicted in the temporal 
domain from a high quality, high resolution reference of the previous 
frame; and 

a high resolution, high quality reference in the current frame is 
reconstructed from, and is of the same quality as, the high quality, high 
resolution reference in the previous frame; 

a fifth coding mode that encodes macroblocks to produce the second 
bitstream having high and low resolution components in a current frame, wherein: 

the one or more enhancement layers include a plurality of high 
resolution enhancement layers and a plurality of low resolution 
enhancement layers; 

one said low resolution enhancement layer predicts another said 
low resolution enhancement layer of higher quality in the transform 
domain; 
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one said high resolution enhancement layer predicts another said 
high resolution enhancement layer of higher quality in the transform 
domain; 

the high resolution enhancement layers are not predicted in the 
transform domain from the low resolution enhancement layers; 

the low resolution components are predicted in the temporal 
domain from a low quality, low resolution reference in a previous frame; 

the high resolution components are predicted in the temporal 
domain from a high quality, high resolution reference of the previous 
frame; and 

a high resolution, high quality reference in a current frame is 
reconstructed from the low quality, low resolution reference in the 
previous frame; 

a sixth coding mode that encodes macroblocks to produce the second 
bitstream having high and low resolution components in a current frame, wherein: 

both of the low and high resolution components are predicted in 
the temporal domain in the temporal domain from a high quality, high 
resolution reference in a previous frame; and 

a low resolution, low quality reference in the current frame is 
reconstructed from a low quality, low resolution reference in the previous 
frame; 

a seventh coding mode that encodes macroblocks to produce the second 
bitstream having high and low resolution components in a current frame, wherein: 
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both of the low and high resolution components are predicted in 
the temporal domain from a high quality, high resolution reference in a 
previous frame; and 

a high resolution, high quality reference in the current frame is 
reconstructed from the high quality, high resolution reference in the 
previous frame. 

35. (Currently Amended) A computer-readable medium having computer- 
executable instructions, which when executed on a processor, direct a computer to 
perform the method s teps of Claim 34. 

36. (Original) The method as defined in Claim 34, wherein the layered coding 
techniques comprise a Progressive Fine Granularity Scalable (PFGS) video coding. 

37. (CurrentlyAmended) A method for coding macroblock video data 
according to layered coding techniques in which the macroblock v ideo data is represented 
as multi-layered frames in multiple layers ranging from a base layer of low quality to 
enhancement layers of increasingly higher quality, the frames including frames n-1, n, 
and wherein frames n-1 and n+1 have a plurality of references, the method 
comprising: 

encoding macroblocks of the video data to produce a first bitstream representing a 
base layer; 

encoding macroblocks of the video data t o produce a second bitstream 
representing a plurality of high resolution enhancement layers and a plurality of low 
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resolution enhancement layers using an INTER coding mode selected from athe group 

including consisting of : 

a first coding mode that encodes macroblocks to produce the second 
bitstream having high and low resolution components in the n frame, wherein 
both the low and high resolution components in the n frame are predicted in the 
temporal domain from a low quality, low resolution reference in each of the n-1 
and n+1 frames; 

a second coding mode that encodes macroblocks to produce the second 
bitstream having high and low resolution components for the n frame by temporal 
predictions from: 

a low quality, low resolution reference in each of the n-1 and n+1 
frames to produce low quality, low resolution components in n frame; and 
a high quality, high resolution reference in each of the n-1 and n+1 
frames to produce high quality, high resolution components in n frame; 
wherein one said low resolution, low quality component in the n frame is 
used for prediction in the transform domain of one said high quality, high 
resolution component in n frame; 

a third coding mode that encodes macroblocks to produce the second 
bitstream having high and low resolution components for the n frame by temporal 
predictions from: 

a low quality, low resolution reference in each of the n-1 and n+1 
frames to produce low quality, low resolution components in n frame; and 
a high quality, high resolution reference in each of the n-1 and n+1 
frames to produce high quality, high resolution components in n frame; 
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wherein the low resolution, low quality components in the n frame are not 
used for prediction in the transform domain of the high quality, high resolution 
components in n frame. 

38. (Currently Amended) A computer-readable medium having computer- 
executable instructions, which when executed on a processor, direct a computer to 
perform the st e ps method of Claim 37. 

39. (Original) The method as defined in Claim 37, wherein the layered coding 
techniques comprise a Progressive Fine Granularity Scalable (PFGS) video coding. 

40. (Original) A video delivery architecture, comprising: 

a content provider having a video storage to store video data, a video server to 
serve the video data over a network, a base layer encoder, and an enhancement layer 
encoder, the video server being configured to encode macroblocks of video data 
according to layered coding techniques in which the macroblock video data is represented 
as multi-layered frames, each frame having a plurality of references in multiple layers 
ranging from a base layer of low quality to enhancement layers of increasingly higher 
quality, the frames including frames n-1, n, and n+1, wherein frames n-1 and n+7 have a 
plurality of references, wherein: 

the base layer encoder encodes macroblocks to produce a first bitstream 

representing a base layer; and 

the enhancement layer encoder encodes macroblocks to produce a second 

bitstream representing a plurality of high resolution enhancement layers and a 
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plurality of low resolution enhancement layers using an INTER coding mode 
selected from the group consisting of: 

a first coding mode that encodes macroblocks to produce the 
second bitstream having high and low resolution components in the n 
frame, wherein both the low and high resolution components in the n 
frame are predicted in the temporal domain from a low quality, low 
resolution reference in each of the n-1 and n+1 frames; 

a second coding mode that encodes macroblocks to produce the 
second bitstream having high and low resolution components for the n 
frame by temporal predictions from: 

a low quality, low resolution reference in each of the n-1 
and n+1 frames to produce low quality, low resolution components 
in n frame; and 

a high quality, high resolution reference in each of the n-1 
and n+1 frames to produce high quality, high resolution 
components in n frame; 

wherein one said low resolution, low quality component in the n 
frame is used for prediction in the transform domain of one said high 
quality, high resolution component in n frame; 

a third coding mode that encodes macroblocks to produce the 
second bitstream having high and low resolution components for the n 
frame by temporal predictions from: 
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a low quality, low resolution reference in each of the n-1 
and n+1 frames to produce low quality, low resolution components 
in n frame; and 

a high quality, high resolution reference in each of the n-1 
and «+/ frames to produce high quality, high resolution 
components in n frame; 
wherein the low resolution, low quality components in the n frame are not 
used for prediction in the transform domain of the high quality, high 
resolution components in n frame; 
a client configured to receive the encoded video data served from the content 
provider, the client being configured to decode the video data. 

41 . (Original) A video delivery architecture as recited in claim 40, wherein: 
the video server transmits the encoded video data as composing the base layer and the 

one or more of the enhancement layers; and 

the client decodes the video data from the base layer and the one or more 
enhancement layers. 

42. (Original) A video delivery architecture as recited in claim 40, wherein the 
client reconstructs an enhancement layer in a particular frame from an enhancement layer 
of a reference reconstructed frame. 
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43. (New) A method for reducing drifting errors in a video encoding scheme 
where video data is represented as multi-layered frames, each frame having a plurality of 
references in multiple layers ranging from a base layer of a low quality to enhancement 
layers of increasingly higher quality, the method comprising: 

selectively choosing a coding mode from a plurality of coding modes for each 
macroblock in the video data, the plurality of coding modes differing with respect to 
references used for prediction and reconstruction; and 

encoding each macroblock in the enhancement layers with the chosen coding 

mode. 

44. (New) The method of claim 43, wherein at least one of the coding modes 
uses a different reference for prediction and for reconstruction. 

45. (New) The method of claim 43, wherein the plurality of coding modes 
comprise at least three INTER coding modes. 

46. (New) The method of claim 45, wherein one of the INTER coding modes 
comprises a LPLR mode in which the macroblock is predicted from a previous low 
quality reference and a high quality reference for a next frame is reconstructed from the 
same reference. 
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47. (New) The method of claim 45, wherein one of the INTER coding modes 
comprises an HPHR mode in which the macroblock is predicted from a previous high 
quality reference and reconstructed from the same reference. 

48. (New) The method of claim 45, wherein one of the INTER coding modes 
comprises an HPLR mode in which the macroblock is predicted from a previous high 
quality reference and the high quality reference is reconstructed from a previous low 
quality reference. 

49. (New) The method of claim 43, wherein the coding mode is selected 
based on an iterative drifting model that controls the drifting error in the coding scheme. 

50. (New) The method of claim 43, wherein the coding mode is selected 
based on an temporal prediction of each frame. 

51 . (New) The method of claim 43, wherein each coding mode is utilized for 
at least one of the macroblocks in the video data. 

52. (New) The method of claim 43, wherein at least two of the plurality of 
coding modes are utilized when encoding the macroblocks in the enhancement layers. 

53. (New) The method of claim 43, wherein the video encoding scheme 
comprises a progressive fine granularity scalable (PFGS) video encoding. 
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